METHOD FOR ADAPTING AN INTERNET WEB SERVER TO SHORT-TERM 

CHANGES IN DEMAND 

FIELD OF THE INVENTION 

The present invention relates to the field of the Internet, and more particularly to tracking real- 
5 time characteristics of an Internet web server' s use and improving the performance of the server 
from a knowledge of these characteristics by altering its cache memory. 

BACKGROUND 

Internet users have come to expect nearly instantaneous response. Providing such 
responsiveness, particularly the responsiveness of web servers, depends on maintaining a 
10 favorable balance between resources and demands. 

Resources to provide responsive Internet service have improved remarkably in the recent past. 
Faster processors and memories are available to both the users' workstations and the web sites' 
servers. Faster communication links are provided by optical fiber backbone transmission, cable- 
modem access, and asymmetric digital subscriber loop (ADSL) services. 

1 5 Nevertheless, Internet responsiveness continues to be problematic, due to an ever-increasing 

burden that is placed on the Internet and on its web servers by an ever-increasing number of users 
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and the ever-increasing sophistication of their demands. Moreover, patterns of Internet use may 
shift dramatically over a short time span, further complicating the problem of maintaining the 
delicate balance between resources and demands. For example, a breaking news story may lead 
to an avalanche of demand for related information, or halftime at a one sports event may lead to a 
flurry of queries for scores of other sports events ongoing at the same time, and so forth. 

One way for a server to adapt to changing demand involves keeping and using logbooks that 
record past activities. Based on data kept in the logbooks, the server determines which web 
pages it should make readily available by storing in its cache memory rather than in its main 
memory. Unfortunately, logbooks of past activities are often large, cumbersome, and slow to 
adjust to short-term changes in web server demands. Consequently, when demand changes 
abruptly, the server is caught with the wrong pages stored in cache, and clients must endure delay 
while the server laboriously retrieves pages form main memory rather than from cache. 

Thus, in view of the shortcomings associated with the use of logbooks and the desirability of 
providing responsive web servers, there is a need for a way of tracking nearly instantaneous 
changes in demands on web sites, so that web servers may adapt their cache memories to short- 
term demand changes and reconfigure their resources quickly, in order to provide the most 
responsive services possible. 
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SUMMARY OF THE INVENTION 



The present invention provides a way in which a web site may adapt nearly instantaneously to 
changes in demand. According to one embodiment of the invention, servlets within the web 
server maintain state information concerning requests made by users of the server. The servlets 
associate each user with an HTTP session object. The HTTP session object is configured to 
include information that identifies the last-N web pages requested by the user's browser. 
Periodically, or in response to a triggering event, the server analyzes the contents of the HTTP 
session objects, for example by tabulating the frequency with which each web page has been 
requested in the recent past. From the results of the analysis, web-page caching priorities are 
determined, and the contents of the server's cache or the particulars of its caching algorithm are 
altered accordingly. 

Thus, with the present invention, the server may reconfigure its resources quickly in response to 
abrupt changes in demand. These and other aspects of the present invention will be more fully 
appreciated when considered in the light of the following detailed description and drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows a plurality of clients connected by the Internet to a web server. 



FIG. 2 shows aspects of the operation of the web server of FIG. 1 concerning the gathering of 
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session data. 



FIG. 3 shows aspects of the operation of the web server of FIG. 1 concerning the analysis of 
session data. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention, which enables an Internet web server to reconfigure its resources quickly 
in response to abrupt changes in demand, may be explained in the context of FIG. 1 . In FIG. 1, a 
plurality of clients 100A through 100N are connected to an Internet web server 130 by the 
Internet 120 or by another communication network. Each of the clients 100A through 100N 
includes a web browser 1 10A through 1 ION. The server 130 includes a plurality of servlets 
140A through 140M, which may be Java applets that run on the web server. The server 130 also 
includes a caching algorithm and a fast memory cache 150, which holds web pages that are 
expected to be accessed shortly, in order to provide the clients 100A through 100N with 
responsive service. 

The cache 150 has finite size, however, and not all web pages that might conceivably be 
requested by the browsers 1 10A through 1 10N will fit into the cache 150. Consequently, to 
provide responsive service within the constraint of finite size, only the web pages most likely to 
be requested soon again by the browsers 110A through HON are kept in the cache 150. 
However, the collection of web pages kept in the cache 150 must change when the demands of 
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the clients 100A through 100N change, and the browsers 1 10A through 1 ION begin to request 
web pages that are not then held in the cache 150. 

Session data may be kept and analyzed in order to detect changes in the demands of the clients 
100A through 100N that necessitate changes to the web pages kept in the cache 150. FIG. 2 

5 shows how session data may be kept, according to the present invention. The server 1 30 waits 
for a web page request from one of the browsers 1 10A through 1 ION (step 200). When a request 
is received, the server 130 determines whether an HTTP session object exists for the browser 
(step 210). HTTP session objects reside within memory of the server 130, and are associated 
with the browsers 1 10A through 1 10N by cookies, or, for browsers that do not support cookies, 

10 by URL rewriting. An HTTP session object may include a number of attributes, including a 
session ID that identifies the browser with which the session object is associated. More 
generally, the HTTP session objects enable session tracking, which is a mechanism through 
which the servlets 140A through 140M maintain information about the requests of the browsers 
1 10A through 1 10N. Ordinarily, sessions are shared among the servlets 140A through 140M that 

1 5 are accessed by the browsers 1 10A through 1 10N, although this is not a necessary condition of 
the present invention. 

As shown in FIG. 2, when an HTTP session object exists for a browser 1 10A through 1 10N that 
has requested a web page, the identification of the requested web page is written into the HTTP 
session object that is associated with the browser 1 10A through 1 10N that made the request (step 
20 220). In one embodiment of the invention, the identification of the requested web page is written 
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into a first-in-first-out queue with a capacity of five web page identifications, so that the last five 
web pages requested by a browser are included in the browser's HTTP session object. The server 
then awaits another web page request (step 200). 

Otherwise (i.e., an HTTP session object does not exist for the browser), the server creates an 
5 HTTP session object for the browser 1 10A through 1 ION that is requesting a web page (step 
230), writes the identification of the requested web page into the HTTP session object just 
created for the browser 1 10A through 1 ION that made the request (step 220), and awaits another 
web page request (step 200). 

As shown in FIG. 3, statistics of the identifications of the requested web pages are analyzed from 
10 time to time. Analysis may be undertaken periodically, for example upon the expiration of a 

timer or upon the arrival of a predetermined number of web page requests, or the analysis may be 
undertaken in response to a triggering event, for example a command from an operator, or an 
interrupt from an external source, or an abrupt change in the behavioral characteristics of the 
browsers 1 10A through 1 10N as evidenced by an abrupt change in the arrival rate of requests for 
15 web pages. The purpose of the analysis is to determine a set of caching priorities, so that the 
contents of the cache 150 or the operation of an associated caching algorithm may be altered in 
order to provide more responsive service. 



The server 130 determines as described above whether analysis is needed (step 300). The 
identifications of web pages requested by the browsers 1 10A through 1 10N are read from the 

RSW920010076US1 6 



HTTP session objects (step 310). From these identifications, caching priorities are computed 
(step 320). In one embodiment of the invention, the identifications of the requested web pages 
are ranked, for example from most-frequently requested to least-frequently requested, and 
caching priorities are assigned according to this ranking. In another embodiment of the 
5 invention, the identifications of the web pages are ranked from most-recently requested to least- 
recently requested, and caching priorities are assigned according to this ranking. Segments of the 
cache may then be re-loaded so that the cache contains the web pages that have the highest 
caching priorities, or the caching algorithm may be altered, for example by selecting a new 
caching algorithm (step 330). 

10 From the foregoing description, those skilled in the art will recognize that the present invention 
enables an Internet web server to track nearly instantaneous changes in demands and to adapt by 
quickly reconfiguring its resources, thereby to provide the most responsive services possible. 
The foregoing description is illustrative rather than limiting, however, and the invention is 
limited only by the following claims. 
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